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1. Introduction 



In a finite population of size N, sampling without replacement with unequal inclusion probabilities 
and fixed sample size is not straightforward, but there exist several sampling designs that satisfy 
these properties (see Brewer and Hanif (1983) for a review). Rejective sampling, which is also called 
maximum entropy sampling or conditional Poisson sampling, is one possibility, introduced by Hajek 
(1964). If n denotes the fixed sample size, the n units are drawn independently with probabilities 
that may vary from unit to unit and the samples in which all units are not distinct are rejected. In 
the particular case of equal drawing probabilities, rejective sampling coincides with simple random 
sampling without replacement. Rejective sampling with size n can also be regarded as Poisson 
sampling conditionally on the sample size being equal to n. The unconditional Poisson design can 
be easily implemented by drawing N independently distributed Bernoulli random variables with 
different probabilities of success, but it has the disadvantage of working with a random sample size. 
The conditional Poisson design can also be interpreted as a maximum entropy sampling design for 
a fixed sample size and a given set of first order inclusion probabilities. 
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Rejective sampling has been extensively studied in the literature. Hajek (1964, 1981) derives 
an approximation of the joint inclusion probabilities in terms of first order inclusion probabili- 
ties. By showing that the maximum entropy design belongs to a parametric exponential family, 
Chen, Dempster and Liu (1994) give a recursive expression of the joint inclusion probabilities and 
propose a new algorithm. This algorithm has been improved by Deville (2000), who gives an- 
other expression for the joint inclusion probabilities. Using the results in Chen, Dempster and Liu 
(1994), Qualitc (2008) proves that the variance of the well-known unbiased Horvitz-Thompson 
estimator for rejective sampling is smaller than the variance of the Hansen-Hurvitz estimator for 
multinomial sampling. Several estimators of the variance of the Horvitz-Thompson estimator have 
also been proposed; see Matei and Tille (2005) for a comparison by means of a large simulations 
study. The conditional Poisson sampling scheme is not only of interest in the survey sampling 
field, but also in the context of case-control studies or survival analysis Chen (2000). 

The purpose of the present article is to generalize the result given in Hajek (1964) and Hajek 
(1981), obtained for the first and second order inclusion probabilities of rejective sampling, to 
inclusion probabilities of any order and also to provide a more precise remainder term. The proof 
of our result is along the lines of the proof by Hajek (1981) using Edgcworth expansions and leads 
to approximations that are valid when TV, n and TV— n are large enough. One interesting application 
of our result is that it enables us to show that rejective sampling satisfies the assumptions needed 
for the consistency and the asymptotic normality of some complex estimators, such as the ones 
defined in Breidt and Opsomcr (2000), Breidt et al. (2007), Cardot et al. (2010) or Wang (2009). 
Such assumptions involve conditions on correlations up to order four, which are difficult to check for 
complex sampling designs that go beyond simple random sampling without replacement or Poisson 
sampling. Our result implies that the rejective sampling design also satisfies these conditions. 

In the case-control context, Arratia, Goldstein and Langholz (2005) consider rejective sampling 
and also give approximations of higher order correlations. Their approach and the assumptions they 
need to derive their results arc different from the ones we consider in the present paper. Instead of 
using Edgcworth expansions, they consider an expansion that involves the characteristic function. 
Their results arc obtained using a condition, which is sufficient, but not necessary to derive our 
expansion. In view of this we provide an example of a rejective sampling design that does not satisfy 
the condition in Arratia, Goldstein and Langholz (2005), but does satisfy our weaker assumption. 
Moreover, Arratia et al do not give an explicit approximation formula for higher order inclusion 
probabilities in rejective sampling, whereas we do provide such an approximation, which may be 
of interest in itself. 

The paper is organized as follows: in Section 2 we introduce notations and state our main 
result which is Theorem 1. In Section 3, we apply this result and illustrate that rejective sampling 
satisfies conditions on higher order correlations imposed in the recent literature to derive several 
asymptotic results. Detailed proofs are provided in Section 4. 

2. Notations and main result 

In this paper, we use the first description of rejective sampling by Hajek (1981), namely as Poisson 
sampling conditionally on the sample size being equal to n. Let us denote U as the population of 

size TV. Let < pi,P2, ■ ■ ■ ,Pn < 1 be a sequence of real numbers such that pi +P2 H +Pn = n. 

The Poisson sampling design with parameters pi,P2, ■ ■ • ,Pn is such that for any sample s, the 
probability of s is 

P(s)=l[[pil[(l-Pi)- 

The corresponding rejective sampling design is such that the probability of a sample s is 

{cJJjJi JJ(1 — if size s = n, 
iis (2.1) 
otherwise, 
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where c is a constant such that ^2 s Prs{s) = 1- We refer the reader to Hajek (1981) for more 
details. 

The inclusion probabilities of order k under this sampling scheme are denoted as 

7r*i,i2,-.'* = p Rs(ii S s, i 2 G s, . . . , ik e s) 

for any {zi, i 2 , ... , i/.} C {1, 2, . . . , N}. Our purpose is to obtain an expansion of inclusion proba- 
bilities of any order. Theorem 7.4 in Hajek (1981), see also Theorem 5.2 in Hajek (1964), provides 
such an expansion for inclusion probabilities of order two, i.e., 

TTij = niiTj [l — cZ _1 (l — 7Ti)(l — TTj) + o(d -1 )] , as d — > oo, (2-2) 

uniformly in i, j such that 1 < i =/= j < N, where 

N 

<* = X>(1-Pi)- ( 2 - 3 ) 

i=l 

We will obtain an extension of (2.2) and prove that a similar expansion holds for inclusion prob- 
abilities of higher order. 

Our approach is along the lines of the method used in Hajek (1981). Consider Poisson sampling 
with parameters Pi,p 2 > ■ ■ ■ ,Pn an d denote as P the corresponding probability measure on the 
set of samples under this sampling scheme. For i = 1, 2, . . . , N, we denote as Ii the indicator of 
inclusion of unit i, that is 

I -l(ies)-! 1 if i e s 

1 otherwise. 

For every i = 1, 2, . . . , N, the indicator I{ is a Bernoulli random variable with parameter pi. Define 

K = size s = h + h H h In- (2-4) 

Note that the expectation and the variance of K satisfy Ep(K) = n and Yp(K) = d. By Bayes for- 
mula and by independence of the ij's under Poisson sampling, the inclusion probability 7Tj li i 2i ... ) j f , 
can be written as 

ni u i 2 ,,„,i k = P (1^ = h 2 = ■ ■ ■ = h k = l\K = n) 

P(K = nlli, =I i2 = ..- = I lk = 1) 



PilPiz ' ' ' Pik 



P(K = n) (2.5) 
P{K = n\I n =I i2 = ..- = I lk = 1) 



P(K = n) 

The next step is to use Edgeworth expansions for the probabilities of K . This leads to the next 
lemma. 

Lemma 1. Consider Poisson sampling with parameters Pi,P2, ■ ■ ■ ,Pn? such that pi+p 2 +- ■ -+Pn 
n with corresponding probability measure P on the set of samples. Let d and K be defined in (2.3) 
and (2.4), respectively. Then, for all = {ii, ■ ■ ■ , ik} C {1, 2, . . . , N}, k > 1, it holds that 
if d —> oo, then 

P(K = n) = (2irdy 1/2 {l + ad' 1 + O (cT 2 ) } , 
P(K =n\I il =--- = I lk = 1) = {2^d)- 1 ' 2 {1 + c 2 d- 1 + O (d~ 2 ) } , 



where 



ci = ^(l-6p(l-p)) -^(l-2p) 2 , 

c 2 = \ (B 2 - {B 1 - kf) - i(£?i - k) (1 - 2f) + ci, 



(2.6) 
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with 

N 



=77=^ 2n ^2 s 2 = E ft( 1_ ft)- 

i=l 

The proof of the lemma is provided in Section 4. We are now in the position to formulate our 
main result. 

Theorem 1. For k > 1, let Ak = {ii,i2, ■ ■ ■ ,ik} C {1, . . . , N}. Under rejective sampling (2.1), 
the following approximations hold as d — > oo, where d is defined by (2.3). 

(i) For all k>2, 

" , - : -„ 1 </ 1 (l-Pi)(l-Pj) + 0(d- 2 ) J , (2.8) 

\ i,jeA k :i<j J 



where 0(d 2 ) ZioZds uniformly in i±,i2, ■ ■ ■ ,ik- 
(ii) For all k>2, 

^n 2 ---n k ( i-cT 1 ^ (l- ^0(1 -^0 + 0(0 ) , (2.9) 

i,jeA fc :i<j 

where 0(d~ 2 ) holds uniformly in . . . 

Proof. From Lemma 1, we find 

P(K = n\I n =--- = h k =l) l + c 2 d-i + Q(d- 2 ) 

P(*T = n) l + Cl d-i+0(d- 2 ) ~ 1 + (C2 " Cl)d + ° {d ] 

Together with (2.5) it follows that for all k > 1, 

PiiPi 2 ' ' -Piu {! + ( c 2 - ci)d _1 + 0(cT 2 )} 



'11,42, 



PiiPiz ' ' ' Pik 



l j'eA fc \jeA fc y 



(2.10) 



1 - 2p 
2d 



53 Pj - fc ] + o(d- 

\j&A k 



Applying (2.10) to the case k = 1, yields that the first order inclusion probabilities satisfy 

Pi = n t (1 - d-\ Pl - - Pi ) + 0(cT 2 )) , (2.11) 

and as a consequence, 

P»iP» 2 ■ • -Pt k = 7r »i 7r »2 ■ • "Kik < 1 - E ( p J "KX 1 ^-Pi) + °{ d ~' 

{ 3&A k 

Combining this with (2.10) yields 

Ki u i 2 ,...,i k = ^ii^ii ■ • ,7r !' fc {l + aal1 + 0(<i~ 2 )} 
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where the contribution to terms of order d~ 1 is 



a = 2 E PiO- - Pi) - \ E Pj ■ - k ) - E Pi - fc - E _ P)( X _ Pi) 



= -\ E Pia-a) - ^ f E a + 5 f E (i - Pi)) 

ieA fc \ieA fc y \jeA k J 

= ^E( 1 -p,) 2 -^E( 1 -p,)) 

ieA fc \ieA fc y 



= e (i-Pi)a-Pi). 

t,ieA fe :i<j 

This proves part (i). Part (ii) is deduced immediately from (i) and (2.11). □ 

3. Application: bounds on higher order correlations under rejective sampling 

Conditions on the order of higher order correlations, as N — > oo, appear at several places in the 
literature, see e.g., Breidt and Opsomer (2000), Breidt et al. (2007), Cardot et al. (2010) or Wang 
(2009), among others. Such conditions are used when studying asymptotic properties in survey 
sampling for general sampling designs, but they arc difficult to check for more complex sampling 
designs, that go beyond simple random sampling without replacement. An attempt to provide 
simpler conditions for rejective sampling can be found in Arratia, Goldstein and Langholz (2005). 
They formulate some sort of asymptotic stability condition on inclusion frequencies that ensure 
bounds on general higher order correlations. The purpose of the present section is to explain 
how Theorem 1 can be used to establish several bounds on higher order correlations for the 
rejective sampling design. The bounds in Arratia, Goldstein and Langholz (2005) match with the 
ones that we find for correlations up to order four, which suffices for the conditions imposed 
in Breidt and Opsomer (2000); Breidt et al. (2007); Cardot et al. (2010); Wang (2009). However, 
in order to derive these bounds, we only need the simple requirement that 

N , s 

limsup — < oo, (3-1) 

where d is defined in (2.3). Moreover, one can show that (3.1) is weaker than the asymptotic 
stability condition in Arratia, Goldstein and Langholz (2005) as detailed in section 4.2. 

Before we start a discussion on the assumptions on higher order correlations that appear for 
example in Breidt and Opsomer (2000); Breidt et al. (2007); Cardot et al. (2010); Wang (2009), 
first note that (3.1) necessarily yields that d — > oo, which means that Theorem 1 holds. Moreover, 
condition (3.1) has a number of additional consequences, such as n > d — > oo, N — n > d —> oo, 
and 

N ,. N 

hmsup — < hm sup — < oo. (3-2) 

TV-s-oo n N^oo d 

A typical example of a condition on higher order correlations, is 

limsup n max |Ep(Zj — 7Ti)(7j — 7Tj)| < oo, (3 3^ 

where for every integer t > 1: 

Dt.N = {{iii *2j • • • ,it) '■ ii,i2) • • • , it are all different and each ij 6 {1,2,..., N}} . (3.4) 

Condition (3.3) is one of the assumptions in Breidt and Opsomer (2000) among others. Since 
Ep(/i — Tj)(-^i ~ 7Tj) = 7T;j — TTiitj, condition (3.3) immediately follows from Theorem 1 and (3.2). 

Interestingly, the simple representation of the second order correlations as a difference of sec- 
ond order inclusion probabilities and the product of single order inclusion probabilities can be 
generalized for correlations of higher order as precised in the following lemma. 

imsart-generic ver. 2011/11/15 file: Bolstard_Lopuhaa_Gazen_arxlv.tex date: July 25, 2012 



Boistard, Lopuhad and Ruiz- Gazen/ 'Approximation of rejective sampling inclusion probabilities 



6 



Lemma 2. For any k > 2, let A% = {ii,t2, ■ ■ ■ , h.} C {1,2,..., N}. Then 



E 



Ec- 1 : 



k — m 



E 



m = 2 (ii,...,i m )e_D m . fc 

where D m k is the set of distinct m-tuples in Ak and {i m +i, 



• ,ik} = A-k \ {h, ■ ■ ■ ,i m }- 



(3.5) 



From this lemma, we can prove the following proposition that provides an expansion of higher 
order correlations for rejective sampling. 

Proposition 1. Consider a rejective sampling design. Then, for any k > 3 and any positive 
integers rij, j = 1, 2, . . . , k, 

k 

E =0(d" 2 ) (3.6) 

as d — > oo, where d is defined by (2.3). 

The proofs of Lemma 2 and Proposition 1 are provided in Section 4.3. 

Proposition 1 together with condition (3.2) imply that the following conditions that appear for 
example in Breidt and Opsomcr (2000) are satisfied: 

N 4 

limsup —5- max |E(I, - n)(Ij - 7Tj)(ifc - ir k )(Ii - n)\ < oo 

JV-s-oo Tl (i,j,k,l)eD4, iN 

TV 3 

limsup — 5- max E(Jj — 7r,) (/j — iTj)(Ik — TTk)\ < oo. 
Other conditions on higher order correlations, such as 



lim max - mj)(lkli - m)\ = 0, 



(3.7) 



(3.8) 



that appears in Breidt and Opsomer (2000), can be treated in the same manner. 

The conditions in Breidt et al. (2007) and Cardot et al. (2010) on higher order correlations are 
equivalent to the preceding ones, whereas the conditions in Wang (2009) differ somehow but they 
are all satisfied for the rejective sampling design as consequences of Proposition 1. 

4. Proofs 

4-1- Proof of Lemma 1 

For the proof of Lemma 1, we use Edgeworth expansions for probabilities of sums of independent 
random variables, as given in Theorem 6.2 in Hajek (1981). If K = I\ + I2 ■ • • + In is a sum of 
independent Bernoulli random variables with parameters pi,P2, ■ ■ ■ ,Pn, and d = Y(K). Then, for 
< / < N and m > 1, 

\P(K = 1)- f m (x)\ = o(d^ m+1 ^ 2 ) (4.1) 
where f m {x) is the Edgeworth expansion of P(K = I) up to order m, given by 



with x 



I - E(K) 
dV2 ' 



(4.2) 



where (f> denotes the standard normal density and each Pj is a linear combination of (probabilistic) 
Hermite polynomials involving the cumulants of K. Recall that the Hcrmite polynomials are 
defined by 



H k (x) = (-l)V 2 / 2 



dx k 



-x 2 /2 



(4.3) 



for k = 0,1,2,... . 
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Lemma 3. The polynomials Pj in (4.2) can be expressed as: 

{k m } rn=l 

where the sum is taken over all sets {k m } consisting of all non-negative integer solutions of 

k 1 +2k 2 + --- + jkj = j, (4.5) 

and r is defined by k\ + k% + • • • + kj = r, and where K m is the m-th cumulant of K and Hj+ 2r is 
the Hermite polynomial of degree j + 2r as given in (4.3). 

Proof. The proof relies on the Edgeworth expansion of P(K = I), e.g., see (43) in Blinnikov and Moessner 
(1998), 

^ J=l {k m } m=l vv ; 

where x = (I — E P (A'))rf -1 / 2 and S m = Km/d" 1 ^ 1 . This means that 

p jW _,■*» E « JwW n ^ 

{fe m } m=l 

Note that 

n *h - n - ri (^f n - <-< n (^f . 

m— 1 m— 1 m— 1 m— 1 m— 1 

according to (4.5). This yields (4.4). □ 

The next lemma shows that the cumulants of the sum of independent Bernoulli variables are 
of the same order as the variance. 

Lemma 4. Let K = I\ + 12 + ■ • • + In be a sum of independent Bernoulli random variables with 
parameters Pi,P2, ■ ■ ■ ,Pn- Let d = Y(K) = ^2^ =1 Pi{^ — Pi)- Then, for any positive integer m, we 
have n m = 0(d), as d — > 00, uniformly in p\,p 2 , ■ ■ ■ ,Pn- 

Proof. Recall that the cumulants of a random variable X are defined as the coefficients in the 
expansion of the logarithm of the moment-generating function, i.e., if 

g(t) = logE(e' x ) = V K m — , 
* — ' m! 

m— 1 

the m-th cumulant is K m = g( m \0). This implies that the m-th cumulant K m of the sum of 
independent Bernoulli random variables is equal to the sum of the m-th cumulants e m of the 
individual Bernoulli variables. Moreover, we have the following recurrence relation between the 
cumulants of a single Bernoulli variable with parameter p: 

e m +i = p(l - p)-7-e m , (4.6) 
dp 

see for instance, example (c) in Section 4 in Khatri (1959). It is straightforward to see that 

Ki = Pi +P2 + ••• +Pn and k 2 = 2j = ift(l — Pi) = ^- ^ can ^ e proved by induction, using (4.6), 
that e rn = p{\ — p)R m (p), where R m is a polynomial with degree less than or equal to m — 1 and 
with coefficients depending only on m. Thus, K m = dQ m (p), where Q m (p) is of the form 

n , n _ Y.iLlPii 1 - Pi) R m(pi) 
Wm(P) - — Jf 

Li=iPi(i - Pi) 

and is bounded uniformly in pi,P2 ■ ■ ■ ,Pn- This proves the lemma. □ 
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Proof of Lemma 1. We use (4.1) with m = 4. Because Ep(K) = n, formula (4.2) is used with 
x = 0. To precise the expressions of Pj(0), for j = 1, 2, 3, 4, we use Lemma 3. It follows from (4.3) 
that the Hermite polynomials satisfy the following recurrence relationship 



(1 



H k+1 (x) = -e* H k (x)e-* " . (4.7) 

da; L J 

By induction it follows from (4.7) that for any integer j = 0, 1, . . ., the Hermite polynomials H 2 j 
and H2j+i are of the form 

H2j(x) — aoj + aijx 2 + ■ • ■ + a,jjX 3 , 
H 2 j+i(x) = b±jX + b 2j x 3 H h bjjX 2j+1 . 

It follows that H 2 j+i(0) = 0, for any integer j. Combining this with Lemmas 3 and 4, we can 
see that Py+i(0) = and P 2j (0) = 0{d~ 3 ) for any integer j. Thus, Pi(0) = P 3 (0) = and 
P 4 (0) = 0(d~ 2 ). Moreover, 

P (() \ = Hg{0) k " + = + A^i 

21 ' 2!(3!) 2 d 3 4! d 2 72 d 3 24 d 2 ' 



Finally, from (4.6) one can easily deduce that K3 = d(l — 2p) and K4 = d(l — 6p(l — p)). We then 
obtain: 

P(K =n) = (T 1/2 0(O) J 1 + ]TP,(0) + i d ~' 

= (2^)-v 2 A - A (1 _ 2 ^) 2 d- 1 + 1 (1 - e^^J) d- 1 + o(d- 2 )| 

= (2^)- 1 / 2 {l + cid- 1 + 0(d- 2 )}. 

For the expansion of P(if = n\Ii 1 = ■ ■ ■ = Ii k =1), let Ek denote the event {Ij = 1, for all j G A k } 
and define the random variable K = K \ E k . Note that it can be written as the sum of independent 
Bernoulli's, 

where /* = l.Thus, we can write an Edgeworth expansion for K as stated in (4.1). Since 
E(K) = pj + k = n+ k - ^ pj = n + k - Bi, 

Y(K) = Pii 1 - Pj) = d - E PiO- - Pj) = d-B 2 , 

j^Ak j£A k 



(4.8) 



with d = d — B 2 , the expansion is as follows 

4 



P(K = n) = d^ 2 (b(x) {l + J2 P J*^) ) + °( d_5/2 )' with S = " l E{K) 



3=1 



dV 2 



where the P* 's are the polynomials given in (4.4) corresponding to K. 

Let us first compute an expansion for <i -1 / 2 </>(a;). We start with the expansion of cP 1 / 2 : 

d- 1 ' 2 = {d- B 2 y 1 ' 2 = d- 1 ' 2 jl + \B2d- 1 + 0(<P 2 )} . (4.9) 
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Next, remark that 



(d - B 2 )- 1 ' 2 {B 1 - k) = d- 1 ' 2 {B 1 - k) jl + \B2d- 1 + 0(<T 2 )J , (4.10) 



so that 



4>(x) = (27T)- 1 / 2 jl - ±x 2 + 0(.? 4 )| = (27T)- 1 / 2 jl - - k) 2 ^ 1 + 0(d- 2 ) 

Together with (4.9), this gives 

d~^ 2 (f>(x) = (2nd)- 1 / 2 {1 + cud' 1 + 0{d- 2 )} , (4.11) 

where ai = (B2 — (B\ — k) 2 )/2. Finally, we compute P*(x), for j = 1, 2, 3, 4. First, let us compute 
the third and fourth cumulants of K. We have that 



K4 = k 4 - - 6pj + 6p 2 ) = k 4 - B 4 , 

with i? 3 and -B4 ad hoc constants. Thus, by Lemmas 3 and 4 with (4.9) and (4.10), 

Pm = ^f 1 ^ =~\§ji + 0(5? 3 )) = -\(Bi - k) (1 - 2f) rf- 1 (1 + O^ 1 )) , 
and likewise 

,.*\2 IT. f~\ „* / K /„*\2 



p 

r 2 



~ (1 - 23) 2 + qW^) } ^ (1 + oid- 1 )) . 

Moreover, similarly to Lemma 4, one has K* m — 0(d), for any positive integer to. Hence, for any 
integer j, P^(x) = 0(d~ j ) and P 2 * i+1 (x) = 0(<H 3 ' +1 )), so that P 3 *(£) = 0(d- 2 ) and P 4 *(£) = 
0(gT 2 ). It follows that 

4 

1 + P j^) = 1 + c *i d ^ + °( d ~ 2 )> ( 4 - 12 ) 

3=1 

where 

3 = - \ (Bi k) (1 - 23) A (1 _ 2 f ) 2 + I (l - GpJT^pj) 
= -\(B 1 -k)(l-23)+ci. 
Combining (4.11) and (4.12) proves the lemma. □ 

4-2. Comparison with assumptions in Arratia et al. 

In Arratia, Goldstein and Langholz (2005), the following condition is used for rejective sampling. 
For all 5 G (0, 1), there exists e G (0, 1), such that 

N r e ^ ^ 

-5. (4.13) 



I f 1 1 
lim sup — 1 <^ < pi < } > 1 
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This condition implies our condition (3.1), because 



N / 1 \ 

d = Y,Pi{l-Pi)>N{l-5)^- il- — ) >/VA>0, 



where A = (1 - tf)(e/(l + e)) 2 £ (0, 1). 

However, our condition is weaker, in the sense that we can construct an example which satis- 
fies (3.1), but not (4.13). To this end, suppose that n/N -> 7 £ (0, 1). Take S £ (0, 1), such that 
0<7<1— S < 1. Furthermore, choose a £ (0, 1), such that 0<7<a<l— <5 < 1, and let 
k = aN. Then define 



Pi 



1 

Pk = - and p k +i 



. n/N - 7 
PN = On = — > 0- 



a 1 — a 

First note that this choice is possible in rejective sampling, since 

'Y n/N — 'y 

Pi H +p N = k x - + (TV - k) x S n = /V7 + N(l - a)- 1 - = n. 

a 1 — a 

With these probabilities, condition (4.13) is not satisfied for any e G (0,1), because for N suffi- 
ciently large Pk+i = ■ ■ ■ = p^ < e/(l + e), so that 



JV 



i=l 

whereas condition (3.1) is fulfilled, as 

N ~ N ~ N ~ N ~ N \a 

i=l 

where A = (7 - j 2 /a)/2 £ (0, 1). 



— — < Pi < z — ~ — \ < ^7 = a < 1 - 5, 
iV ^ 1 1 + e F l + e ~ N 



d n l^o n k fy\ 2 N - k 2 n 7' 2 7 

7; — o« = T7 1 — a )" n ~ > 7 > ^ 

N n N a y ' n a ~ 



4-3. Proofs of Lemma 2 and Proposition 1 

Proof of Lemma 2. We decompose the product in the following way: 



3 



II'/- '~ : 

3=1 



7r il7 r i2 ...u ik {-l) k +E 



V V h. h ...Ii TT l . ...71*. (-l) fe " 

m=l r> m>fc 

= 7r n 7r i2 ...7T ife (-l) fe + ^ 2J ^iVa-ijm^im+i ■■• 7r i 3fc (- 1 ) fc_m 

m=l -D m , fc 

fc 

= 7 7 (7T,- - ,■ . ,■ . — 7Ti. 7Ti. . . . 7T,- . ) 7T, . ... 7Tj ■ ( — 1) m 

m=l D m _ k 

k 

n^i 2 ■ ■ ■ K lk (-1)* + ni n ■ ■ ■ n ij m+ i ■ • ■ ^ii h 



k—m 



m=l D„ 
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The last two terms on the right hand side are equal to 

k 

7r il 7r i2 ...7r ifc (-l) fc + E E . . . ir. lk (-l) k - 



= 1 D„ 



= 7r ll7 r 42 ...Tr lk (-l) k + n n TT l2 ...n ih E (^(-^ 

m=l ^ ' 

= n tl n t2 . . . n tk E ( M (-l) fc - m = tt^ . . .7^(1 - l) fc = 0. 

rn=0 ^ ' 

Proof of Proposition 1. The proof is by induction on the powers nj. We first prove that 

fe 



E 



= 0(d" 2 ), 



□ 



(4.14) 



for any Ak = {ii, 12, ■ ■ ■ , ik} C {1, 2, . . . , AT}, with 3 < fc < N and then add an extra power one by 
one. From Lemma 2, we have that 



lie * i 



E fax, 

(ii,.,! m )eA„,t 



where {i m +i, . . . , ifc} = \ {ii, . . . , i m }- From Theorem 1, we have that 
where the sum runs over all i < j, such that i,j S {ii, . . . , i m }- This means that 



3=1 



= -d 1 TT il ■■■TT ik ^(-1) 



k — rn 



E £(i-^)(i-^) + o(d- 2 ). 

m = 2 (»i,—> «™)e-D m ,fc «<J 



For 2 < m < k fixed, consider the summation 

E E^-^K 1 -^)- 

(ii,...,j m )eD m-fc »<j 

The first summation is over all possible (ii, . . . ,i m ) € -D m ,fc, which arc all possible combinations 
of m different indices from = {ii, 12, ■ ■ ■ 5 ik}- From each such combination ix, ...,i m , the 
second summation picks two different indices i < j from the set {ii, . . . , i m }. This means that any 
combination of (1 — tt.;)(1 — Wj), with {i,j} C Ak is possible. In fact, each such combination will 
appear several times and we only have to count how many times. Well, for a fixed combination 
from the k possibilities A^, we need to pick i and j, and for the m — 2 remaining choices 
there are k — 2 possibilities left. We conclude that each term (1 — 7r^ ) (1 — nj), with {i,j} C Ak, 
appears 2 ) times. Moreover, this holds for any m = 2, 3, . . . , k. This means that 



E(-i) fc - m E E^-^xi-^-) 

k 

= e (i-^xi-^oEt- 1 ) 

{i,j}CA k m=2 



\k—m 



k-2 

m - 2 
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where 

m=2 ^ ' n=0 ^ ' 

We conclude that the coefficient of d _1 is zero, which proves (4.14). 

Next, suppose that the expectation is of order 0(d~ 2 ) for all powers 1 < rrij < rij, and consider 

e [(4 - nJ^+Hii, ^)" 2 • • • (4 - * lk T k ] ■ 

This can be written as 

E [I h (I tl - TTiJ" 1 (I i2 - ^; 2 )" 2 • • • (4 - TTiJ"*] 

- 7Ti 1 e [(^ - TT^fH^ - • • • (i ik - n ik r] 

= e [4(4 - - ^)" 2 • • • (ii* - ^)" fc ] + 0(d- 2 ) 

according to the induction hypothesis. Next, write 

4(4 - 7r 4i r = (i - ^04(4 - tt.J" 1 - 1 

= (l-7T il )(4 -TTiJ" 1 +(1-7^)7^(4 -TTiJ" 1 " 1 . 

When we insert this, we find 

e [(4 - 7 r. il )» 1 +1 (4 - 7r i2 p • • • (4 - nX k ] 

= (1 - TTijE [(J 4l - 7^(4 - ^J" 2 • • • - TiJ" fe ] 

+ (1 - TTijTTfaE [(4 ~ T il )" 1 - 1 (4 - 7T l2 )" 2 • • • (4 - TtJ"*] + 0(d" 2 ) 

= 0(d- 2 ) 

by applying the induction hypothesis. 

□ 
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